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Abstract. In the present work, we propose a new method aiming at extracting the kinetic Sunyaev-Zel'dovich 
(KSZ) temperature fluctuations embedded in the primary anisotropies of the cosmic microwave background 
(CMB). We base our study on simulated maps without noise and we consider very simple and minimal as- 
sumptions. Our method essentially takes benefit from the spatial correlation between KSZ and the Compton 
parameter distribution associated with the thermal Sunyaev-Zel'dovich (TSZ) effect of the galaxy clusters, the 
later can be obtained by means of multi-frequency based component separation techniques. We reconstruct the 
KSZ signal by interpolating the CMB fluctuations without making any hypothesis besides the CMB fluctuations 
are Gaussian distributed. We present two ways of estimating the KSZ fluctuations, after the interpolation step. 
In the first way we use a blind technique based on canonical Principal Component Analysis, while the second 
uses a minimisation criterion based on the fact that KSZ dominates a small angular scales and that it follows a 
non-Gaussian distribution. We show using the correlation between the input and reconstructed KSZ map that 
the latter can be reconstructed in a very satisfactory manner (average correlation coefficient between 0.62 and 
0.90), furthermore both the retrieved KSZ power spectrum and temperature fluctuation distribution are in quite 
good agreement with the original signal. The ratio between the input and reconstructed power spectrum is indeed 
very close to one up to a multipole £ ~ 200 in the best case. The method presented here can be considered as a 
promising starting point to identify in CMB observations the temperature fluctuation associated with the KSZ 
effect. 

Key words. Cosmology: Cosmic microwave background - Methods: Data Analysis 



1. Introduction 

The Cosmic Microwave Background (CMB) temperature 
anisotropies contain the contribution of both the pri- 
mary cosmological signal, directly related to the initial 
density fluctuations, and the foregrounds amongst which 
are the secondary anisotropies generated after matter- 
radiation decoupling. They arise from the interaction 
of the CMB photons with the matter and can be of 
a gravitational type (e.g. Rees-Sciama effect (Rees & 
Sciama 1968)), or of a scattering type when the matter 
is ionised (e.g. Sunyaev-Zel'dovich (SZ) effect (Sunyaev 
& Zel'dovich 1972) or Ostriker-Vishniac effect (Ostriker 
& Vishniac 1986; Vishniac 1987)). Among all these sec- 
ondary anisotropies, the dominant effect is the SZ effect. 
It represents the inverse Compton scattering of the CMB 
photons by the free electrons of the ionised and hot intra- 
cluster gas. It results in the so-called thermal SZ (TSZ) 
effect whose amplitude is characterised by the Compton 
parameter y (the integral of the pressure along the line of 
sight). The TSZ amplitude thus depends only on the clus- 
ter electron temperature and density distributions. The 
inverse Compton effect moves the CMB photons from the 



lower to the higher frequencies of the spectrum. This re- 
sults in a peculiar spectral signature with a decrement at 
long wavelengths and an increment at short wavelengths. 
When the galaxy cluster moves with respect to the CMB 
rest frame, with a peculiar radial velocity v r , the Doppler 
shift induces an additional effect often called the kinetic 
SZ (KSZ) effect, which generates temperature anisotropies 
with the same spectral signature, at least in the non- 
relativistic approximation, as the primary CMB fluctu- 
ations. 

The interest of the TSZ effect for cosmology has been 
recognised very early (see reviews by IRephaeli (1995)| 



Birkinshaw (1999) and Carlstrom et al. (2002) I. It is 
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a powerful tool to detect high redshift galaxy clusters 
since it is redshift independent. In combination with 
X-ray observations it can be used to determine the 
Hubble constant and probe the intra-cluster gas dis- 
tribution. Moreover, the KSZ effect may be the one 
of the best ways of measuring the cluster peculiar 
velocities by combining thermal and kinetic effects 
HSunyaev fc Zel'dovich 1 980). The advantages of this 
method are: (i) it yields directly the peculiar velocities, 
bypassing the need to measure inaccurate distance 
indicators flFaber fc Tully 1976| |Tully fc Fisher 1977) ; 
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(ii) the method has a physical explanation and (iii) it is 
independent of distance. The KSZ can be distinguished 
from the TSZ effect due to the different frequency de- 
pendence of their intensities. The KSZ intensity reaches 
its maximum at a frequency of ~ 218 GHz, just where 
the TSZ intensity is zero . Hence, this is the optimal 
frequency to the detect the KSZ signal. It has also been 
shown HHobson et al T9981 |Bouchet fc Gispert 19991 

Baccigalupi et al. 2000| IDelabrouille et al. 20021 

Kuo et al. 20021 |Maisinger et al. 20031 ) that the TSZ 
signal can be extracted from the other astrophysi- 
cal contribution by component separation techniques 
(Wiener filtering, Maximum Entropy, Independent 
Component Analysis, ...). Despite the scientific in- 
terest of the KSZ effect as a probe of large scale 
matter distribution and structure formation theories, 
very few measurements of the peculiar velocities were 
achieved (|Holzapfel et al. 1997| ILamarre et al. 19981 
IBenson et al. 2003|l . As a consequence, very few meth- 
ods have been proposed so far to address the specific 
underlying question of separating the secondary KSZ fluc- 
tuations from the primary anisotropics. In an early work, 
Haehnelt & Tegmark (1996) used an optimal filtering 



(Wiener), with a spatial filter derived from X-ray obser- 
vations of galaxy clusters, that minimises the confusion 
with CMB. However, this method implied the knowledge 
of the CMB power spectrum. Aghanim et al. (1997) 
rather used a matched filter optimised on simulated data 
and independent of the underlying CMB model. Recently 



Hobson & McLachlan (2003) presented a Bayesian ap- 



proach for detecting and characterising the signal from 
discrete objects embedded in a diffuse background. 
They showed that this approach is around twice as 
sensitive as the linear optimal filter approach proposed 



by Haehnelt fc Tegmark (1996) 



In the present study, we propose a new method opti- 
mised to extract from the primary anisotropies, the tem- 
perature fluctuations, associated with the KSZ effect. The 
method is based on the fact that we have two sets of maps 
(provided, in a realistic case, by component separation 
techniques), the first set contains both CMB and KSZ 
temperature fluctuations and the second set consists of 
Compton parameter maps associated with the TSZ effect 
which is used as a spatial template. In our study, we do not 
use real (observed) maps but we rather use two sets of sim- 
ulated maps. We were able to retrieve, in the best possible 
way, the amplitude and the distribution of the tempera- 
ture fluctuations associated with KSZ together with the 
associated power spectrum. 

2. Methodology 

In a "real-life" case, it is worth noting that the appli- 
cation of the method we propose here is based on the 
fact that a first-step component separation is performed 
on the CMB data leaving us with a TSZ effect map and 
a temperature fluctuation map containing primary and 
KSZ anisotropies. In the present study, we focus on the 



description of the method and the way it is intrinsically 
limited by the pure cosmological signals primary CMB + 
SZ effect (without adding any instrumental effects). It is 
beyond the scope of this first study to address the instru- 
mental effects (this will be the subject of a future work), 
therefore and as mentioned above, we use simulated cos- 
mological data. Namely, we simulate 15 (512 x 512 pixels) 
primary CMB, TSZ and KSZ maps with a pixel size of 
1.5 arc-minutes. A precise description of the SZ simula- 
tions is given in Aghanim et al. (2001). The KSZ effect 
induces temperature fluctuations that can be written as 
(5£sz = (AT/T)ksz = ~^t, with c and r the velocity of 
light and the cluster Thomson optical depth. The primary 
CMB and the KSZ anisotropies having the same spectral 
shape in the non-relativistic approximation, we construct 
maps of radiation temperature fluctuations, St, by adding 
the two signals S T = (AT/T) K sz + (AT/T) CMB - We are 
thus left with two data sets of pure cosmological signals, 
one consisting of the temperature fluctuation maps (CMB 
+ KSZ) and the other consisting of the Compton param- 
eter maps, y, for the TSZ effect. For this study, we adopt 
a low matter density flat model defined by: fi m = 0.3, Oa 
= 0.7 and h = H /100 km/s/Mpc = 0.65. 

Provided the two types of maps, y maps for the TSZ 
effect (mean a = 1.17 10~ 5 ) and 5 T maps for CMB + 
KSZ, our goal is to obtain the best possible estimate 
of the KSZ. To achieve this goal, we benefit from the 
fact that the TSZ and the KSZ features are spatially 



correlated as already noted by Diaferio et al. (2000) and 
Sorel et al. (2002) The later showed that the absolute val- 



ues of the covariance coefficients between TSZ and KSZ 
maps are significantly high even though the correlation 
coefficient between the maps does not exceed 0.1 in abso- 
lute value. This low value is due to the fact the signs and 
amplitudes of the KSZ anisotropies in a map depend on 
the distribution of the radial peculiar velocities which is a 
random variable with zero mean. The spatial correlation 
between TSZ and KSZ simply means that both effects are 
due to galaxy clusters. Therefore, where the TSZ signal 
is present so are the KSZ fluctuations regardless of their 
signs or amplitudes. Conversely, where the TSZ fluctua- 
tions are absent, so are the KSZ fluctuations and the signal 
at that position in the St map is therefore associated with 
the CMB anisotropies only. 

Our technique to separate the KSZ fluctuations from 
the primary CMB anisotropies is based on this simple 
statement. It allows us to build up a two-step strategy 
in which: (i) we first derive the best estimate of the CMB 
map, and (ii) consequently deduce the best estimate of the 
KSZ map. 



2.1. Estimating the primary CMB anisotropies 

In this section, we address the first step of our separation 
method, namely we derive an estimate (the best possible) 
of the primary fluctuation map. For this we use the two 
observables: The TSZ map and the 5t map which con- 
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tains both the primary and the KSZ fluctuations. Given 
the above mentioned statement on the spatial correlation 
between TSZ and KSZ signals, the basic idea in order to 
estimate the primary CMB anisotropics, is to use the TSZ 
map as a mask to select in the St map, the pixels where 
the TSZ fluctuations are not present, i.e. where only pri- 
mary anisotropics are present. The rest of the pixels in 
the map are missing or masked pixels. We then interpolate 
the 8t signal on these missing pixels with the constraint 
that the pixels where TSZ is absent i.e. with signal as- 
sociated with primary CMB only, keep their values after 
the interpolation is achieved. We therefore end up with 
an estimated primary CMB map where the St signal in 
the masked (missing) pixels is obtained from the inter- 
polation. Formally, the KSZ map can then be estimated 
simply by computing the difference between the original 
unmasked St map and the primary CMB map estimated 
with the interpolation. 

2.1.1. Interpolation 

We already note that the "recovery" of the KSZ map 
heavily relies on the performances of the interpolation 
method. We first use the method described in Unser 
(1995). Consider the problem of the minimisation of a gen- 
eral criterion written as: 

E(u)= ™(k,l)[f(k,l)-u(k,l)] 2 + 

A [dx*<k,l)] 2 +[d v *u(k,l)] 2 (1) 

(fc,z)ez 2 

where / is an input image, u is the desired solution w ^ 
is a map of space- varying weights, d x and d y are the hor- 
izontal and vertical gradient operators, respectively. The 
second space-invariant term in Eq.^is a membrane spline 
regulariser; the amount of smoothness is controlled by the 
parameter A. Taking the partial derivative of Eq. Qlwith 
respect to u, we find that u is the solution of the differen- 
tial equation : 

f w = Wu + XLu = Au (2) 

where W is the diagonal weight matrix, f w — W f the 
weighted data vector, L is the discrete Laplacian opera- 
tor and A — W + XL a symmetric definite matrix. The 
inversion of Eq.[3is achieved using a multi-grid technique 
(Wesseling, 1992). Typically, we need two V-cycles with 
two iterations in the smoothing Gauss-Seidel part of the 
algorithm to reach a residual of the order of 10 -6 . In 
our case, the interpolation of the primary CMB map is 
achieved by setting the weights to zero where the data are 
missing, i.e. in the masked pixels, and to one elsewhere 
and by resolving Eq. [21 The value of A then determines 
the tightness of the fit at the known data points (un- 
masked pixels), while the surface u is interpolated such 
that the values of the Laplacian of u is zero elsewhere. In 



the present work, we impose a low value for A so that the 
recovered values at the known data points are equal to the 
original values. This criterion can be relaxed to take into 
account corruption of the data by additive white noise 
(Unser, 1995). In this case, the optimum regularisation 
parameter A can be defined as: 

A = E(f.Lf)-4<r* (3) 

where a 2 is the variance of the noise and E(f.Lf) de- 
notes an estimate of the correlation between the noisy im- 
age f and its Laplacian Lf. In the other cases (non white 
noise) , the optimal regularisation parameter A may be de- 
termined from the data using cross-validation methods 
( Wahba 1977), or from a given measurement model of the 
signal + noise HReeves 1994j) . 

It is possible to improve the performances of the inter- 
polation, and hence of the retrieved KSZ map, by setting 
non-zero values to the Laplacian of u at the missing data 
points (which are set to zero in the original method). The 
values we set for the Laplacian of u are such that the 
first and second derivatives of the interpolated signal are 
continuous throughout the interval. These continuity con- 
ditions characterise the cubic B-spline functions which are 
known for their simplicity and their performances in terms 
of signal reconstruction (Unser et al., 1993; Thevenaz et 
al. 2000). In practice, these additional conditions imply 
that the source term f w in Eq. |2 is modified to impose 
non-zero values at the points where the weights are set 
to zero (i.e. the missing data points). An equivalent way 
to solve Eq. |21 with the above mentioned conditions, is to 
replace the Laplacian operator L, by the quadratic oper- 
ator L 2 . These two interpolation methods are obviously 
not the unique techniques and other techniques (based on 
textures for example) can be used. In the following we 
will only test the two operators described above and then 
choose among them, the one which gives the most satis- 
fying results. 

2.1.2. Defining the mask or missing pixels 

We must now define more precisely what we mean when 
we state where the TSZ is not present, or in other terms 
how do we select the missing data points? Besides the 
pixels that actually contain no galaxy clusters, i.e. no SZ 
contributions, this statement means that we fix a thresh- 
old value for the TSZ amplitude below which we consider 
the TSZ signal is too small to be detected. The corre- 
sponding pixels in the St maps are then considered to be 
associated only with the primary CMB signal. On the con- 
trary, above this threshold the corresponding pixels in the 
St map are considered to be the missing data points, i.e. 
masked pixels that we want to interpolate. It is clear that 
the number and location of the missing data will depend 
on the threshold. The lower it is, the larger the number of 
missing data we need to recover. The choice of this thresh- 
old has also important consequences on the quality of the 
interpolation. 
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Fig. 1. The correlation coefficient between the original 
KSZ map and the series of 17 estimated KSZ maps as a 
function of the standard deviations of the estimated KSZ 
maps. Upper panel is for the best case and lower panel for 
the worst case. The triangles and the solid line stand for 
the interpolation with the biharmonic operator, the dia- 
monds and the dashed line are for the interpolation with 
the Laplacian. The interpolation with the biharmonic op- 
erator gives better results especially for the KSZ maps 
with low standard deviation. The vertical lines mark the 
standard deviation of the original KSZ maps (2.6 10 -6 and 
1.2 10 -6 ). The standard deviation of the primary CMB is 

1.9 icr 5 . 



When the threshold is high, on the one hand, the num- 
ber of missing data is small and the interpolated surface 
is good. On the other hand, the selection retains only the 
clusters with the highest TSZ and misses the majority of 
clusters. In this case, we expect to end up with a low cor- 
relation coefficient between the retrieved and the original 
KSZ maps. When the threshold is low, we take into ac- 
count a majority of clusters, but the interpolated surfaces 
are large and the quality of the interpolation suffers from 
that. Moreover, the characteristic scale of the interpolated 
surfaces becomes, in this case, of the order of that of the 
CMB fluctuations, leading to "confusion effects" in the in- 
terpolation. From these remarks, we can infer that: Firstly, 
there will exist an optimal threshold value for which the 
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Fig. 2. For the best case (upper curve and triangles) and 
worst case (lower curve and diamonds), the correlation 
coefficient between the original and the 17 estimated KSZ 
maps as a function of the associated 17 TSZ threshold 
values. The highest threshold value is of the order of y = 
3.5 10 -5 . The interpolation method uses the biharmonic 
operator. 



correlation between the retrieved and the original KSZ 
maps is maximum. Secondly, the restoration of extended 
clusters is likely to be of low quality as already noted by 
|Haehnelt fc Tegmark (1996)| 



Obviously there is no a priori way of choosing the TSZ 
threshold on an objective basis. Indeed, given the TSZ 
map is obtained (in "real-life") from a component sepa- 
ration process involving the true signal but also the in- 
strumental and observational effects, one cannot rely on a 
"theoretical" expected value. Therefore, rather than per- 
forming only one interpolation of the primary CMB map 
for one single TSZ threshold, we propose to retrieve a set 
of interpolated CMB maps corresponding to a set of TSZ 
threshold values. The later can be defined in a simple way 
without any theoretical or observational prior as follows: 
We compute the cumulative distribution function of the 
TSZ values in the given map and we search for the values 
corresponding to 5% to 95% of the total number of pixels 
(with a step of 5%). This gives us a set of 19 threshold 
values such that all pixels in the TSZ map that have y pa- 
rameters above the threshold are identified as the missing 
data points in the simulated St map, i.e. the mask. In the 
present study, we are using simulated TSZ maps, i.e. with- 
out noise. These maps exhibit a background of zero values 
which proportion represents in our case at least 10% of the 
total pixel number. This characteristics implies that the 
first two TSZ threshold values associated with 5 and 10% 
of the total pixel number are irrelevant. In the following, 
we thus use only the 17 highest TSZ thresholds. 
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2.1.3. Results 

For each of the 15 simulated maps, we obtain 17 TSZ 
threshold values, and thus 17 masked St maps. We apply 
the interpolation techniques (Sect. I2.1.1fl to recover the 
primary CMB signal in the masked regions. For each of 
the 15 simulated maps, we thus end up with 17 estimated 
primary CMB maps corresponding to the 17 TSZ thresh- 
old values. The associated KSZ maps are evaluated simply 
by subtracting the interpolated primary CMB maps from 
the total St map. 

In order to evaluate how well the two interpolation 
methods presented in Sec. 12.1.11 do recover the KSZ sig- 
nal, we compute for each of the 17 KSZ estimated maps 
the correlation coefficient between the original input KSZ 
map and the estimated KSZ maps. The results are shown 
in Fig.^ The data points representing the correlation co- 
efficients are plotted as a function of the standard devia- 
tion of the estimated KSZ map for each of the 17 threshold 
values. The diamonds and the dashed line represent the 
case where the interpolation is such that the Laplacian 
values are set to zero, and the triangles and the solid line 
are for the case in which the Laplacian values are non-zero. 
The upper panel in Fig. ^ shows our best recovery case in 
terms of correlation coefficient. The lower panel is for our 
worst case. The correlation coefficients between the origi- 
nal input KSZ map and the 17 estimated KSZ maps are 
also displayed as a function of the 17 TSZ threshold values 
in Fig. [5] It is worth noting that the high TSZ thresholds 
(abscissae in Fig. |2J correspond to low KSZ standard de- 
viations (abscissae in Fig.^l. 

First, we note from Fig. ^ that for any standard de- 
viation of the estimated KSZ map, the correlation coeffi- 
cient between the original and the estimated KSZ maps 
is higher when the Laplacian values are non-zero than 
when they are set to zero. Actually there can be a signif- 
icant improvement in the KSZ reconstruction if an opti- 
mised interpolation method is used. This is especially true 
for the maps with low standard deviations. The improve- 
ment brought by the biharmonic operator is of the order of 
20% in our worst case (Fig.^ lower panel). We will there- 
fore use, in the following, the most powerful interpolation 
method that is the one with the L 2 operator. 

Second as expected, the correlation coefficient in- 
creases when the TSZ threshold decreases as shown in 
Fig. [2] (i.e. when the standard deviation of the estimated 
KSZ map increases in Fig. ^) . The correlation coefficient 
reaches a maximum value and then it decreases for the 
lowest TSZ thresholds (i.e. the highest KSZ standard de- 
viations). Moreover, we note that among the set of 17 
KSZ estimated maps the one with the highest correlation 
coefficient is shifted towards lower values proportionally 
to the standard deviation of the original KSZ map. We will 
use this behaviour later on in the minimisation procedure. 



2.2. Reconstructing the KSZ map 

In the previous step, we have interpolated the St signal 
to estimate the primary CMB map and then extract the 
KSZ signal as a function of a set of TSZ thresholds. We 
have tested the performances of two interpolation methods 
by comparing through a correlation coefficient each of 
the 17 estimated KSZ maps, corresponding to the 17 TSZ 
threshold values, to the original input KSZ map which, 
of course, we do not have in "real life". The set of 17 
estimated KSZ maps were obtained by subtracting the 
interpolated primary CMB maps from the total St map 
of the temperature fluctuations. 

In this step, using the set of 17 KSZ estimated maps 
associated with the set of 17 TSZ thresholds, we search 
for a method that gives us either the reconstructed KSZ 
map which is the closest to the original KSZ signal or even 
better, the combination of the set of KSZ maps giving the 
best estimate of the original KSZ map. In the following, 
we have explored two ways to achieve this goal (restricted 
to linear combinations only). The first one is to decorre- 
late the set of images by a canonical Principal Component 
Analysis (PC A), the second way is to minimise a criterion, 
which in our case is related to the non-Gaussian character 
of the KSZ signal. 

2.2.1. Decorrelation with Principal Component 
Analysis 

It is obvious from our definition of the masked pixels (Sec. 
12.1. 2fl that all the interpolated maps (defined by the set 
of 17 TSZ thresholds) are highly correlated. The first and 
natural approach to decorrelate them is thus to use a 
PCA method. As noted in Sec. 12.1.21 decreasing the TSZ 
threshold increases the characteristic scale of the struc- 
tures we have to interpolate. This means that the confu- 
sion between the extended clusters and the primary CMB 
anisotropics increases. As a consequence, decreasing the 
TSZ threshold increases the proportion of the St signal 
due to the primary CMB in the estimated KSZ maps. In 
this context, the purpose of performing a PCA is to decor- 
relate the signal due to the primary CMB from that due to 
the galaxy clusters. We expect to find the high frequency 
part of the KSZ effect in one principal component, and in 
a second principal component, the low frequency part of 
the KSZ signal (essentially the extended clusters) together 
with the contribution from the primary CMB fluctuations. 

For each of the 15 simulated maps, we apply the PCA 
to the set of 17 estimated KSZ maps. We find that the 
first and second principal components represent respec- 
tively ~ 70% and ~ 15% of the total input signal. In our 
approach, it is the first principal component which stands 
for the KSZ reconstructed signal. It is thus interesting 
to evaluate how well the PCA performs the reconstruc- 
tion. To do so, we compute for each of the 15 simulated 
maps the correlation coefficient between the first prin- 
cipal component and the original input KSZ map. The 
correlation coefficient averaged over the 15 maps reaches 
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0.73 which is satisfactory. However, the standard devia- 
tion of the first principal component is on average smaller 
by almost 50% than the standard deviation of the original 
KSZ signal. The PCA method clearly underestimates the 
reconstructed KSZ signal which is an obvious weakness 
of the method. We will thus investigate in the following 
minimisation methods. 

2.2.2. Statistical minimisation 

By minimising on the known KSZ signal (from our dataset 
of 15 input maps), we can first search for a linear combina- 
tion of the set of 17 estimated KSZ maps that is the clos- 
est to each original KSZ in the sense of least squares. This 
has been done using a standard singular value decomposi- 
tion IjPress et al. 1992). We compute again the correlation 
coefficient between the original KSZ map and the recon- 
structed map obtained from the minimisation to estimate 
the power of the method. We find an average correlation 
coefficient (over the 15 simulated input maps) of 0.8, only 
slightly higher than the PCA result of 0.73. However, the 
standard deviations of the reconstructed maps are again 
significantly lower than that of the original KSZ maps by 
almost 25 % on average (better than in the PCA case). 
Furthermore, the results of the least square minimisation 
depend strongly on the set of estimated maps that are 
used which is clearly undesirable. 

In order to avoid this problem and to obtain as more 
map-independent results as possible, we must identify a 
trustful criterion to minimise on. The latter should ideally 
give at the same time a result that is the closest possible 
to the largest correlation coefficient of 0.80 on average 
(obtained with the least square minimisation), and recon- 
structed KSZ maps with the closest possible standard de- 
viations to those of the original KSZ signal. Moreover, 
a good minimisation criterion would be a criterion that 
characterises the KSZ signal only, excluding the primary 
CMB signatures. We have identified two properties of the 
KSZ fluctuations that fulfill this definition: 

— The KSZ signal dominates the primary CMB at high 
wave numbers (small angular scales). 

— The KSZ effect is a highly non-Gaussian process con- 
trary to the primary CMB which is a Gaussian process. 

The analyses of the available CMB 
data QCayon et al. 2003| IKomatsu et~al~2 003 

ISantos et al 2003 1 all seem to agree on the fact that 
primary CMB anisotropies are Gaussian distributed 
as expected from the simplest inflationary models. By 
contrast, the SZ effect is definitely characterised by its 
non-Gaussian signatures. Using wavelet analysis, we have 
demonstrated (Aghanim & Forni 1999; Forni & Aghanim 
1999), that the excess kurtosis of the wavelet coefficients 
allows us to discriminate between a Gaussian primary 
CMB signal and a non-Gaussian process like the SZ effect. 
We note though that a recent re-analysis of the WMAP 
data (Vielva et al. 2003) suggests that the observed 
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Fig. 3. Standard deviations of our set of 15 KSZ original 
simulated maps (triangles) as compared to the standard 
deviations of the 15 reconstructed KSZ maps (squares). 
The reconstruction is based on the minimisation of the 
statistical criterion (see text). 



CMB anisotropies exhibit non-Gaussian signatures. The 
analysis indicates that the deviations from Gaussianity 
concern large scales (a few degrees). Such a behaviour 
if confirmed would therefore not affect our minimisation 
criteria since the latter is based on the characteristics of 
the signal at small angular scales (a few to a few tens of 
arc- minutes). In particular, the statistical properties of 
the wavelet coefficients at the lowest decomposition scale 
(3 arc-minutes) reflect the properties of the SZ effect 
only. This is due to our choice of the wavelet basis and of 
the decomposition scheme which focus on the scale where 
the SZ signal dominates over the primary CMB. Within 
this choice, the wavelet analysis provides us with the 
wavelet coefficients associated with diagonal, vertical and 
horizontal details in the analysed map. Finally, we have 
shown in Aghanim & Forni (1999) that in the case of the 
SZ effect the diagonal details are by far the most sensitive 
to the non-Gaussian signatures (recently confirmed and 
explained by Starck et al. (2003)). 

In Table we compare, using the 9/7 bi-orthogonal 
filter bank of IjCohen et al. 199"0)l and for our worst and 
best cases the statistical properties (standard deviation, 
skewness and excess kurtosis) of the diagonal details of the 
KSZ maps and CMB+KSZ maps at the first decomposi- 
tion scale (3 arc- minutes). We also give the values of the 
three quantities for the primary CMB maps. We immedi- 
ately note that both sets of wavelet coefficients for KSZ 
and KSZ + CMB share the same statistical properties and 
are quite different from those of the primary CMB alone. 
This confirms that not only the KSZ signal dominates 
over the primary CMB (same standard deviation, i.e. 
same power), but also that the non-Gaussian signatures 
in the KSZ + CMB maps are associated with the KSZ 
effect (same skewness and excess kurtosis). The above 
mentioned properties characterise, in a mixture of CMB 
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Standard deviation 


Skewness 


Excess kurtosis 


KSZ+CMB 


6.45 10" ' 


0.10 


8.71 


KSZ 


6.45 lCT 7 


0.10 


8.72 


KSZ+CMB 


2.05 10"' 


0.22 


8.97 


KSZ 


2.09 10 -7 


0.23 


9.15 


CMB 


1.60 10" s 


-0.02 


0.45 



Table 1. The statistical properties of the first scale (3 arc- minutes) diagonal wavelet coefficients distribution for the 
St map (KSZ + CMB), the KSZ map and the primary CMB alone. The two cases stand for our best case (first pair) 
and the worst case (second pair). We note that the three moments are almost identical and characterise well the KSZ 
fluctuations; they are very different from the CMB fluctuations properties. 



+ KSZ fluctuations, the KSZ effect only. Consequently, we 
can confidently minimise on them. In practice, we choose 
the following criterion : 



C = Min[ 



(M 2 (w Q ) - M 2 (w)) 2 , (Mitw ) - Mi{w)f 



M 2 2 (w ) 



M\{w Q ) 



1(4) 



where wq is the distribution of the diagonal wavelet co- 
efficients for the known St map (KSZ + CMB) and w 
is the distribution of the diagonal wavelet coefficients for 
the desired solution map. A^2 and M.4 are respectively 
the second and the fourth moments of the wavelet coeffi- 
cients. This criterion has thus the advantage of taking into 
account both the energy (or power) content of the coeffi- 
cients, through the second moment, and the non-Gaussian 
character, through the fourth moment. We have chosen 
the fourth moment to characterise the non-Gaussian prop- 
erty because it is the moment for which the KSZ signal 
is the most sensitive to non-Gaussianity as shown by 



the hydro-dynamical simulations of da Silva et al. (2001) 



Clearly, we might also include the third moment of the 
wavelet coefficients to the criterion. This would be needed 
in particular if we were dealing with a "skewed" signal 
such as distorted CMB anisotropies by the weak lens- 
ing of large scale structures. Taking the fourth moment 
in the minimisation criterion allows us in turn to focus 
on the reconstruction of KSZ maps excluding the signal 
that might contribute at that particular angular scale. In 
our minimisation criterion, we have added the second and 
fourth moments quadratically. We have thus attributed 
equal weights to the power and to the non-Gaussian char- 
acter of the KSZ signal. It is possible to envisage a different 
weighting of one term or the other in Eq. 0] This would 
in principal enhance the non-Gaussian signal, for exam- 
ple, and thus ease its separation from a Gaussian signal. 
Such a non-quadratic mixture would be particularly useful 
at scales where the KSZ effect is not the dominant pro- 
cess. However, there is a priori no trivial way of setting 
the weights. This possibility should be investigated in the 
future. 

The solution map w is obtained by minimisation of 
the criterion £ over all the combinations of wavelet coef- 
ficients wq allowed by our set of 17 estimated KSZ maps 
(there are 15 simulated input maps). However, there are 
far too many combinations and we therefore choose to 



reduce the number of cases. To do so, we recall the obser- 
vation made in the previous section that the correlation 
coefficients between the original KSZ map and the 17 es- 
timated KSZ maps exhibit a maximum value (see Fig.^). 
In order to reduce the number of combinations for the min- 
imisation, we then adopt the following iterative strategy: 
At each step, we first eliminate the estimated KSZ map 
which contains the highest contribution from the CMB, 
i.e. the one with the highest standard deviation. Then we 
minimise £ using the remaining set of estimated maps. 
Finally, we take as a solution of the minimisation criterion 
the map corresponding to its lowest value. 

In addition to the previous conditions (power and non- 
Gaussian character), we also make use in the minimisa- 
tion process, of a nice property of the wavelet transform, 
which is it preserves the spatial information. In our case, 
this means that we can identify the diagonal wavelet co- 
efficients that are spatially associated with the clusters 
in the TSZ map. Thus instead of minimising over all the 
wavelet coefficients of the data map (wq in Eq.^l, we can 
minimise only over the coefficients corresponding to the 
clusters. This has two advantages; the first is to enhance 
the non-Gaussian character and the second is to reduce 
the influence of other possible non-Gaussian processes that 
could affect the anisotropy map St- 

In Fig.0 we present the standard deviations of the 15 
original simulated KSZ maps (triangles) and of the 15 re- 
constructed KSZ maps (squares) obtained by the minimi- 
sation technique described above. The agreement is pretty 
good even for the maps with the lowest standard devia- 
tions. We find the error on the standard deviation is only 
of the order of ~5%. This is much smaller than what was 
obtained from the PCA method (~ 50 %) or from the 
least square minimisation method (~25%). Furthermore, 
the mean value (over the 15 original maps) of the correla- 
tion coefficient between the original and the reconstructed 
KSZ maps is 0.78. It is slightly better than the value ob- 
tained with the PCA method and quite close to the value 
of 0.80 obtained with the least square method. The quality 
of the KSZ map reconstruction can be observed in Figs. 
0] and [5] which display, for our best and worst cases re- 
spectively, the histograms of the temperature fluctuations 
and the power spectra of both the original (solid line) and 
reconstructed (dashed line) KSZ maps as well as the ratio 
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Fig. 4. Top and middle panels: Histogram and power spec- 
trum of the original KSZ map (solid line), and of the re- 
constructed KSZ map (dashed line). The reconstruction 
is based on the minimisation of the statistical criterion 
(see text). The bottom panel exhibits the ratio of the two 
power spectra. Note the correlation coefficient between 
the original and reconstructed KSZ maps of ~ 0.9 and 
the total power P Tea i and P cst 



of these two power spectra. It is worth noting that the 
ratio is close to one over a large range of multipoles (an- 
gular scales) even in the domain where the primary CMB 
dominates the KSZ signal by orders of magnitude. We 
also notice the correlation coefficients between the origi- 
nal and the reconstructed KSZ maps which reaches ~ 0.9 
in our best case and 0.62 in our worst case. The compar- 
ison between the standard deviations of the original and 
the reconstructed map a Tea i and a cs t also gives a global 
indication on how well the method works. Clearly, the 
method we propose to separate between the KSZ signal 
from the primary CMB anisotropics despite their identical 
frequency dependence allows us to obtain such results be- 
cause we were not only able to estimate correctly the am- 
plitude of the KSZ signal for most clusters but also their 
angular separation as well as the amplitude of the back- 
ground (primary CMB). This is nicely exhibited by the 
superposition of the cuts across the reconstructed (dashed 
line) and the original (solid line) KSZ maps, once again for 




Multipole 1 

Fig. 5. Same as figure^] This is our worst case and corre- 
sponds to the original KSZ map with the lowest standard 
deviation. Note the low correlation coefficient 0.62 

the best and worst cases (Figs.[(|]and[7] respectively). The 
method partially fails to find broad KSZ features due to 
their important level of confusion with the primary CMB 
fluctuations (Sec. I2.1.2J) . Moreover, since the minimisa- 
tion process is an overall procedure, it can occasionally 
happen that relatively large features (i.e of the order of 
10~ 5 in absolute AT/T) are poorly recovered. 

3. Sensitivity test 

We have tested our method to separate the KSZ 
anisotropics from the primary CMB signal on simulated 
maps free of any noise. Moreover, we did not take into 
account other astrophysical contributions than the CMB 
and the SZ effect themselves. In "real-life", the data are 
corrupted by instrumental noise and astrophysical signals. 
Additional noise (whatever its origin) will have a first ef- 
fect of reducing the ratio between the primary CMB and 
the KSZ signals. We test the performances of our method 
to this effect by applying our procedure to one same KSZ 
map that is added to the same primary CMB map. The 
standard deviation of the KSZ signal is reduced while 
the CMB standard deviation is kept the same CMB map. 
This results in lowering the KSZ contribution to the St 
map. We arbitrarily choose to reduce the standard devia- 
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Fig. 6. Cuts across the best reconstructed KSZ map Fig. 7. Same as figure for the worst reconstructed KSZ 
(dashed line) and its original counterpart (solid line). The map and its original counterpart, 
cuts have the same position in both maps. 



tion following a geometrical progression <7j = OQ\f 7 i with 
i = 0, 6 and ao=2.5 10~ 7 . The highest standard deviation 
is then a m ax=2.0 10~ 6 which is a typical value for our 
dataset (see Fig.EJ. 

We also test the sensitivity of our method to the 
wavelet transform entering in the minimisation criterion 
by comparing the results obtained using two different bi- 
orthogonal wavelet bases, the commonly used 9/7 tap fil- 
ter IjCohen et al. 1990|l and the 6/10 tap filter given by 
Villasenor et al. (1995). 

The results for this new set of maps are displayed 
in Table El in terms of the standard deviations of the 
original and reconstructed KSZ maps, and of the corre- 
lation coefficient between the original and reconstructed 
KSZ maps. We first notice that the results do not depend 
much on the wavelet basis. As expected, the quality of 
the reconstruction (given in terms of the correlation coef- 
ficient) increases with the standard deviation of the origi- 
nal KSZ map from 0.5 to ~ 0.8. The smallest coefficients 
are obtained for very low standard deviations (< 10~ 6 ). 
For the KSZ map with the lowest standard deviation, 
the histogram of the reconstructed map (Fig. |S| upper 
panel, dashed line) shows that the smallest temperature 
fluctuations are not resolved, which produces an excess of 



zero values. More generally, the histogram of the recon- 
structed map behaves like a global envelope to the original 
histogram (Fig. |H] upper panel, solid line) which does not 
resolve the details, e.g. the excess of points around At/T 
= 1.3 10~ 6 . In addition, we not an overall raise of the 
wings of the distribution. Also the power spectrum com- 
puted from the reconstructed KSZ map (Fig. 00 middle 
panel, dashed line) presents an excess of power as com- 
pared to the original power spectrum around £ = 2000 
due to the spatial distribution of the unresolved fluctua- 
tions. The latter also causes the lack of power at higher 
multipoles. This behaviour can be also observed in figure 
El which shows a cut across the KSZ maps (solid line for 
the original signal, and dashed line for the reconstructed 
signal,) at the same position but for different standard de- 
viations. It can be noticed that the large scale feature ~ 
50 arc-minutes wide at the centre of the cut is poorly re- 
solved due to confusion with the CMB fluctuations. In that 
worst case, the amplitude of the estimated signal remains, 
however, proportional to the input signal, but generally 
the estimation becomes better with increasing standard 
deviation of the input signal, as it can be seen for the ~ 
10 arc-minutes wide fluctuation at the left part of the cut. 
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Table 2. Standard deviations of the KSZ maps and correlation coefficients between original and reconstructed KSZ 
maps for the same KSZ map with standard deviations ranging from 2.5 10 -7 to 2.0 10 -6 . Two wavelet bases are 
tested. 
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Fig. 8. Top and middle: Histogram and power spectrum 
of the original KSZ map (solid line) , and the reconstructed 
map (dashed line). Bottom panel exhibits the ratio of the 
two power spectra. The standard deviation of the orig- 
inal map is very low (cro=2.5 10~ 7 ). Note the excess of 
near zero values in the histogram of the estimated map 
(logarithmic scale). Note also the very low correlation co- 
efficient 0.48. This is for the worst case. 



4. Discussion 

We present a method for separating the KSZ signal 
from primary CMB anisotropies based on two steps: 1) 
Interpolation and 2) reconstruction. Our results clearly 



depend on the quality of the interpolation used to esti- 
mate the primary CMB signal and thus the KSZ maps. In 
our case this corresponds to the interpolation of a corre- 
lated noise, namely the CMB. The results we present in 
this study seem already very satisfactory but might cer- 
tainly be improved. 

The KSZ reconstruction is based on the set of KSZ 
estimated maps obtained with a specific choice of TSZ 
thresholds. We have used here a rather simple but robust 
method (based on the cumulative distribution function of 
the pixels in the TSZ map) to determine these thresholds, 
more sophisticated methods optimising the series of TSZ 
thresholds need to be investigated. 

Using our straightforward choice of thresholds, we 
have investigated two methods to reconstruct the fi- 
nal KSZ maps: A decorrelation and a minimisation. 
The first method is based on the decorrelation ap- 
proach using the PCA. It significantly underestimates 
the standard deviations of the reconstructed KSZ maps 
as compared to the original signal by 50% on av- 
erage. More sophisticated decorrelation methods can 
also be used. Preliminary tests with the Independent 
Component Analysis (ICA) IjCardoso & Soulamiac 1 993 
|Hyvarinen 1999} give promising results in terms of the 
standard deviations. However, the results obtained from 
the ICA need to be rescaled using external flux constraints 
which are not always (or easily) available in "real-life" . For 
example in our case, we would need to use the fluxes of 
known clusters to calibrate the reconstructed KSZ maps 
on the original signal. Despite this limitation, we will con- 
tinue investigating this method in the future. The decorre- 
lation method is a blind method which advantage is that 
no a priori criteria are needed to obtain the KSZ map. 
However, the resulting maps are of low quality in terms 
of standard deviation. The second reconstruction method 
we use is based on a minimisation technique that takes 
into account the statistical properties of the KSZ signal, 
namely: (i) KSZ dominates over the primary anisotropies 
at small angular scales, and (ii) the KSZ fluctuations fol- 
low a non-Gaussian distribution with a non-zero excess 
kurtosis. In the present study, we use the excess kurto- 
sis of the diagonal wavelet coefficients to characterise the 
non-Gaussian signatures of the KSZ effect. However, we 



11 




2.0-10- 
1.5*10" 
1.0.10" 
5.0x10" 
I 

-5.0x10" 
-1.0x10" 




Pixel NuiiilH-i 



Fig. 9. Cuts across original (solid line) and reconstructed 
(dashed line) KSZ maps, at the same position, but with 
increasing (from top to bottom) original standard devia- 
tion (for a: 2.5 10~ 7 , 5.0 KT 7 , 1.0 10" 6 and 2.0 10~ 6 ). The 
important difference between original and reconstructed 
signals in the middle of the cuts illustrates the fact that 
large scale structures (this one is ~ 50 arc-minutes) are 
poorly resolved due to confusion with CMB fluctuations. 
Note that the largest fluctuation in the left is not recon- 
structed for the lowest a and that the quality of its re- 
construction increases with increasing a. 



vestigation of noise and additional astrophysical contribu- 
tions is quite important but it is beyond the scope of our 
present study. It should a priori be partly treated in the 
first step component separation (from which we obtain 
the observables: y and 5t maps). However, some contri- 
bution from the TSZ signal may remain in the 5t map, be- 
cause of imperfect component separation or when the rel- 
ativistic corrections to the SZ effect are not corrected for, 
for example. This will act as an additional and correlated 
noise. As shown by Diego et al. (2003)) this introduces a 
non-Gaussian signature into the CMB signal and hence 
errors in the KSZ reconstruction. This non-Gaussian con- 
tribution due to the TSZ effect is characterised by a non- 
zero skewness. We can account for this source of corre- 
lated noise and thus correct for it, either at the interpola- 
tion stage with the additional constraint that the skewness 
should be zero (which is the case for the primary CMB 
anisotropies) , or in the minimisation procedure using a 
generalised criterion including the skewness as well as the 
excess kurtosis. Another way to overcome this difficulty, 
is to apply our method to the individual frequency maps 
and correct for any TSZ spurious contribution. As a mat- 
ter of fact, the technique we present can be applied to 
separate TSZ fluctuations from the primary fluctuations. 
The correlation between two frequency channels, where 
TSZ dominates, gives indeed a first order spatial template 
which can be used to obtain the TSZ signal and thus to 
predict the primary CMB. At this point, the first order y 
map can be used in the next step to better estimate, in an 
iterative way, the TSZ signal itself. We plan to investigate 
this method in the future. As for the instrumental noise, 
it can be taken into account in the interpolation step by 
relaxing the parameter A (Eq. 2). However, if the noise is 
not white then other interpolation methods might have to 
be used (Sec I2.1.1|) for discussion. Another way to deal 
with the noise is to minimise not on the non-Gaussian 
character of the KSZ, but rather on the statistical proper- 
ties of the remainder (i.e. CMB+noise+other components) 
at scales where CMB dominates. We will then obtain an 
estimate of all the components except KSZ that can then 
be subtracted to the total signal. 



could generalise the minimisation criterion to include the 
third moment (skewness) in order to account for, and 
thus separate, between different processes contributing to 
the signal and having different non-Gaussian characters. 
The minimisation method we propose gives reconstructed 
KSZ maps that are in quite good agreement with the orig- 
inal signal with an average correlation coefficient between 
original and reconstructed KSZ map of 0.78, and an error 
of 5% in the standard deviation of the reconstructed KSZ 
maps. However, the minimisation method depend greatly 
on the minimisation criteria and therefore on an a priori 
knowledge of the reconstructed signal. 

The results presented here are based on an ideal case 
where only the two signals CMB and SZ are taken into 
account. This simplified test case allows us to investigate 
the ultimate intrinsic limitations of the method. The in- 



The present work is based on the use of a spatial tem- 
plate to separate KSZ temperature fluctuations from the 
primary fluctuations. This point has been already noted 



by Haehnelt fc Tegmark (1996)| who used an X-ray emis- 
sion template to measure the peculiar velocity of clus- 
ters. The choice of the spatial template is an important 
issue for our method since it is used to define the mask 
and hence the interpolated regions. The spatial template 
should then be the closest possible to the signal (SZ effect 
in our case). The optimal choice is really to use the TSZ 
template itself (similarly to the commonly used matched 
filter approach). Since SZ traces the intra-cluster gas, we 
could also use the X-ray emission of clusters as a template. 
The problem in this case is that the X-ray emission scales 
with the product nlT^ 2 , whereas the TSZ scales with 
n e T e , and consequently the spatial extension of clusters is 
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underestimated by taking X-ray templates. Additionally, 
the X-ray observations of galaxy clusters are restricted to 
a rather small fraction of objects not too distant to suf- 
fer from the dimming effect and with high enough intra- 
cluster temperatures to be detected. The TSZ effect on the 
contrary is redshift independent and less sensitive to the 
gas parameters. Moreover, using the TSZ map as a tem- 
plate in our method has the advantage of evaluating the 
temperature fluctuations (even very low amplitude ones) 
associated with KSZ in the map without resorting to the 
knowledge or the measurement of the cluster parameters 
(n e , T e ). The method presented here has proven its success 
in achieving this goal. In particular, Sec. [3] illustrates how 
well is the KSZ map reconstructed when the input KSZ 
signal is decreased by one order of magnitude in terms of 
standard deviation. 



5. Conclusion 

In this first attempt to extract a map of the KSZ tem- 
perature fluctuations from the CMB anisotropics we use 
a method which is based on very simple and minimal as- 
sumptions. We discuss the issue of noise and astrophysical 
contributions but we do not take them explicitly into ac- 
count. Therefore, our results show the intrinsic limitations 
of the method in terms of reconstructing a KSZ map from 
a mixture of CMB and KSZ anisotropics. We demonstrate 
that the 15 KSZ reconstructed maps are in quite good 
agreement with the original input signal with a correla- 
tion coefficient between original and reconstructed maps 
of 0.78 on average, and an error on the standard deviation 
of the reconstructed KSZ map of only 5% on average. 

To achieve these results, we use the hypothesis that 
a first step component separation provides us with: (i) a 
map of Compton parameters for the TSZ effect of galaxy 
clusters, and (ii) a map of temperature fluctuations for the 
primary CMB + KSZ cluster signal. Our method essen- 
tially takes benefit from the spatial correlation between 
KSZ and TSZ effects towards the same galaxy clusters. 
This correlation allows us to use the TSZ map as a spa- 
tial template in order to mask, in the CMB + KSZ map, 
the pixels where the clusters must have imprinted an SZ 
fluctuation. In practice a series of TSZ thresholds is de- 
fined and for each threshold, we estimate the correspond- 
ing KSZ signal by interpolating the CMB fluctuations on 
the masked pixels. The series of estimated KSZ maps fi- 
nally is used to reconstruct the KSZ map through the 
minimisation of a criterion taking into account two sta- 
tistical properties of the KSZ signal (KSZ dominates over 
the primary anisotropies at small scales, KSZ fluctuations 
are non-Gaussian distributed). 
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